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Abstract —The problem of distributed dynamic state estima¬ 
tion in wireless sensor networks is studied. Two important 
properties of local estimates, namely, the consistency and 
confidence, are emphasized. On one hand, the consistency, 
which means that the approximated error covariance is lower 
bounded by the true unknown one, has to be guaranteed so that 
the estimate is not over-confident. On the other hand, since the 
confidence indicates the accuracy of the estimate, the estimate 
should be as confident as possible. We first analyze two different 
information fusion strategies used in the case of information 
sources with, respectively, uncorrelated errors and unknown but 
correlated errors. Then a distributed hybrid information fusion 
algorithm is proposed, where each agent uses the information 
obtained not only by itself, but also from its neighbors through 
communication. The proposed algorithm not only guarantees 
the consistency of the estimates, but also utilizes the available 
information sources in a more efficient manner and hence 
improves the confidence. Besides, the proposed algorithm is 
fully distributed and guarantees convergence with the sufficient 
condition formulated. The comparisons with existing algorithms 
are shown. 

1. Introduction 

One of the most fundamental but important properties 
that an estimator should have is the consistency [1], [2]. 
That is, the approximated error covariance of an estimator 
should be lower bounded by the true error covariance. The 
approximated error covariance of an inconsistent estimate 
does not actually indicate its uncertainty [3]. This would 
cause issues in performance when it is further used in the 
downstream functions (data associations etc.). The consis¬ 
tency is especially important in the information fusion of 
distributed sensor networks. When fusing the information 
sources with unknown correlations, simply ignoring the un¬ 
known correlations might cause inconsistency in estimation. 
The Covariance Intersection (Cl, [2]) algorithm is proposed 
to guarantee the consistency of estimates in the fusion 
process without knowing the cross-correlations between in¬ 
formation sources. On the other hand, although an estimate 
should be consistent, it should not be too conservative. An 
over-conservative estimate indicates less confidence, which 
further indicates that the estimate is less useful. Therefore, it 
is significant to maximize the confidence while maintaining 
the consistency of the estimates. 

Related work. Ref. [4] considers the consensus algorithm 
from the Kalman filters’ perspective. The algorithm is further 
modified in [5] so that unbiased estimates can still be ob¬ 
tained even if the outflows of each agent are not equal to each 
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other. Both algorithms in [4] and [5] are proposed to estimate 
a static state. Ref. [6] proposes the well-known Kalman- 
consensus filter (KCF), which combines the consensus filter 
and the local Kalman filter, and estimates the state of a linear 
dynamic process. The optimality and stability of the KCF 
is analyzed in [7]. The KCF requires the assumption that 
each agent and its neighbors have joint observability of the 
state of interest. If a state of interest is observed by neither 
a certain agent nor any of its local neighbors, this agent is 
referred as being naive about the state [8], [9]. An example of 
camera networks is shown in [8], where the generalized KCF 
(GKCF) is proposed, and is shown to outperform the KCF 
especially when there exist naive agents. The same situation 
is considered in [9], where the information-consensus filter 
(ICF) is proposed, and is shown to asymptotically approach 
the centralized Kalman filter (CKF) when there are infinite 
communication steps in between local updates. Recently, 
Ref. [10] proposes an innovative approach by achieving 
consensus on the local probability density functions (PDFs), 
referred as the “Kullback-Leibler average” (KLA) therein. It 
is analytically shown in [10] that each agent will eventually 
achieve bounded approximated estimate error covariance if 
the graph is strongly connected, even if each agent commu¬ 
nicates with its neighbors for only once before updating its 
local estimate. 

Contributions. While the current literature primarily 
focuses on the recovery of centralized estimators or the 
estimate errors of distributed algorithms, in this paper, we 
emphasize on the importance of the consistency and con¬ 
fidence of estimates. Considered from the perspective of 
information fusion, a distributed hybrid information fusion 
(DHIF) algorithm is proposed. In the proposed DHIF, each 
agent cooperates with its neighbors, utilizes all available 
information sources, and comes up with its local estimate 
that is consistent while relatively confident. We focus on 
a more realistic scenario where only one communication 
step is allowed between neighbors before they update their 
local estimates. The proposed algorithm is robust against 
naive agents and does not require any global parameters. 
The sufficient condition for bounded approximated local 
estimate error covariance is formulated. The comparisons 
with existing algorithms are shown. 

II. Preliminaries 

A. Notations 

Through this paper, we use the following notations. 1 is 
an all-one vector with appropriate dimension. 0 is an all-zero 
matrix with appropriate dimension. I„ is the n x n identity 



matrix. E[-] denotes the expectation of a random variable. Z* 
is the set of non-negative integers. Z+ is the set of positive 
integers. For any arbitrary square matrices Xi and X 2 with 
the same dimensions, Xi y X 2 (respectively, X^ y X 2 ) implies 
that Xi —X 2 is a positive semi-definite (respectively, positive 
definite) matrix. Xi > X 2 (respectively, Xi > X 2 ) implies that 
Xi — ^2 is a non-negative (respectively, positive) matrix. For 
an arbitrary matrix X, is the entry on its ith row, 

jth column. tr(X) is the trace of X. When we refer X 0 
as a large/small matrix, we mean X is large/small in the 
sense of the mean squared error (MSE). If X is nonsingular, 

X-nT A fQj. „ g 2 *. 

B. Graph Theory 

A directed graph is used to represent the com¬ 

munication topology of a large-scale sensor network, where 
'f = {I,- - • ,A} and S Q'f' x'y are, respectively, the set of 
N vertices that stands for the local agents, and the set of 
edges that stands for the communication channels. Several 
basic concepts of graph theory used later in this paper are 
briefly listed here. An edge {i,j) G S denotes that agent j can 
receive information from agent i. If a graph is undirected, 
{i,j) G S’ implies that {j,i) G S. A directed path from agent 
io to agent ii is a sequence of vertices ,ii such that 

{ij-i,ij) G S for 0 < j < £. A directed (respectively, undi¬ 
rected) graph is strongly connected (respectively, connected) 
if there exists at least one path from every vertex to every 
other vertex. Nj = {j\{j,i) G S, Vj /} is the neighborhood 
of agent i. Ji = A, U{/} is the inclusive neighborhood of agent 
i. A graph is complete if j G Ni for any i and j. The in-degree 
of a certain agent i is defined as A, = |A,|. The maximum 
in-degree of a graph is defined as Amax = niax,g{i 2 /}^!- 
A directed spanning tree is a subgraph of ,S) such 
that this subgraph is a directed tree and contains all vertices 
of ,S). For a graph £S{y,S), an associated stochastic 
matrix D is defined as follows. For each i G , {D}(^i p > 0 if 
j G Ji and =0 otherwise. Moreover, p = 1. 

If it also holds that Li{^}(!j) = !> W’ then D is a doubly 
stochastic matrix. 

Definition 1 (Induced family of stochastic matrices): Let 
D be a stochastic matrix associated with the graph ^(fV,S). 
Then the family of stochastic matrices induced by D is 
the set of all possible stochastic matrices associated with 

^{y,s). 

C. Discrete-time Average Consensus Algorithm 

An average consensus algorithm computes the global av¬ 
erage of some variables of interest, through communication 
with only local neighbors. Suppose that a,[0] is the initial 
value of agent i’s local variable. The objective is to compute 
in a distributed manner. Suppose that the graph 
is undirected, the local algorithm implemented at agent i has 
the form of a, [k -f 1] = a,- [k] -I- e LyeAf, W ~ a; W) j Vk e Z* , 
where e G (0,1/Amax) is the rate parameter. By iteratively 
updating the local variable value in this manner, the global 
average can be asymptotically achieved if and only if the 


graph is connected. More details about the average consensus 
algorithm can be found in [ 11 ]. 

D. Linear Information Fusion 

Consider a set of p pieces of information sources, denoted 
as {a,}(Lp where each a,- G K"’' is the addition of a quantity 
linearly related to the same unknown parameter of interest 
a G M", and some unknown noise/errors with zero mean. 
That is, E[a,] = C,a with C; G Vi. For each at, 

assume that the noise/error covariance, denoted as Ra/ — 
E[(a; — E[a,■])(«, —E[a,■])"''], is positive definite and might be 
unknown, but a consistent ( [ 1 ], [ 2 ]) approximation, denoted 
as Raj, is known, i.e., Ra^ A a 0 . 

The objective is to linearly fuse and obtain an 

unbiased consistent estimate of a, i.e., a = Y!i=\Kiai, where 
Ki G Vi are weighting matrices to be selected. Note 

that KiCi — 1„ due to the required unbiasedness of a. Let 
R^ = E[(a — E[ 6 :])(a — E[d:])'''] be the true error covariance 
of a. Let Raij = E[(a,' —E[a;])(ay —E[ay])''']. It follows that 

Rf, = Y.KiRajKj +Y.l.KtRaj^K]. ( 1 ) 

When Raj and Raj^ are unknown, an approximation of R^, is 
required. This is when the problem becomes tricky. On one 
hand, the consistency of the estimate has to be guaranteed 
since an inconsistent estimate is over-confident and the 
corresponding approximated error covariance does not real¬ 
istically implies the uncertainty [3]. On the other hand, as the 
error covariance is a measure of estimate uncertainty. Lor a 
consistent estimate, a smaller approximated error covariance 
indicates more confidence. In the rest of this paper, we 
assume that when fusing information sources, the objective 
is to minimize the trace of the fused error covariance. Two 
situations are considered. 

1) Fusion of information with uncorrelated errors: When 
the information sources have mutually uncorrelated errors, 
i.e., Rajj — 0, Vi, 7 , it follows from Q that R^ = Y^iKjRajKj. 
Thus a reasonable approximation of R^ could be = 
ffiKiRajKj. Note that if Raj y Raj, Vi, it is guaranteed that 
^ Ra- The optimal R^ and a, denoted as, respectively, 
f?* and a*, can be obtained by minimizing tr(f?u) with 
Y^iKjCi = I„ being held to guarantee the unbiasedness. 

Lemma 1: Let K* be the optimal weighting matrices such 
that ir{^jKiRajKj ) is minimized with Y^iKiCi = I„ satis¬ 
fied. Let a* ^ YiK*ai and = K*RajKf. Then, K* = 
{CjR-a^Ci)-^CjR-a^,yiand 

Rl = {Y.CjR-a!Ci)-\ a*=Rl{Y^CjR-a^ai). (2) 

I i 

Proof: Let A £ be the Lagrangian multiplier. 

Construct the following augmented function 

L{K,,--- ,Kp,A)^ivY,KjRa,Kj +^{A(^/:,C,-I„)}( 7 ,,t). 

I j.k i 

By setting the partial derivative with respect to A and each 
Ki to zero, it follows that A/ = ^A*CjRf^, V/ and A* = 
Rf^Ci)^^. Accordingly, a* and Rf, can be obtained 
by their definitions. ■ 


In summary, the fusion steps given by (|^ preserve the 
consistency, and yield the optimal estimate. Note that if Ra/ = 
Rai^i, one can obtain /?* = Rgi- However, this only holds 
when the information sources have mutually uncorrelated 
errors. 

2 ) Fusion of information with unknown correlated errors: 
When the information sources have unknown but correlated 
errors, the last term in Q is nonzero and unknown. In such 
a case, if one approximates R^ with Ru as in the previous 
case, it is not guaranteed that R^ Ra even if hRan Vi. 
The Covariance Intersection (Cl) algorithm is proposed in 
[2], where the estimate and approximated error covariance 
of a, denoted as, respectively, Oc and Rc, are as follows: 

R, = (OiCjRf^'Ci)-\ a, = R,(£cOiCjRf^at), ( 3 ) 

i i 

where o, > 0, Vi are the weights satisfy = 1. The 
Cl algorithm is able to preserve the consistency of the 
information sources (see [2] for detailed proofs), for any 
possible unknown correlated errors. However, the optimality 
is traded in exchange for guaranteeing the consistency. As a 
result, the Cl algorithm could be too conservative especially 
when the information sources are less correlated. 

III. Problem formulation 

Through this paper, we use the sub-index k G Z+ to denote 
each variable at the A:th time instant. Consider the following 
linear time-invariant (LTI) dynamic system 

Xk+i=FxkFBwk, ( 4 ) 

where Xk G K" is the state of interest. F G is the 

state transition matrix. B G is a full rank matrix that 
models the process noise Wk G which is assumed to be 

white Gaussian, i.e., Wk ~ Here Q G is the 

covariance of the process noise and it is assumed that 2 A 0 . 
Moreover, it is assumed that = 0, VA:' G Z* and 

k>k'. 

Suppose that each agent i, i G is able to obtain a local 
measurement Zi.k € M™', which follows a LTI sensing model: 

Zi,k = HiXk + Vkk, ( 5 ) 

where //,■ G is the local observation matrix, v, is the 

local measurement noise and assumed to be white Gaussian, 
i.e., Vi^k with Ri G and Ri 0 being the 

covariance of the measurement noise. 

In large-scale sensor networks, it is common to encounter 
situations where the target of interest is not directly observed 
by a subset of local agents. The following assumption is 
made through this paper. 

Assumption 1: If the target of interest is not observed by 
agent i at time instant k, then Rz^ =0 and Zi^k is arbitrary. 

Assumption[T]essentially states that, an agent observing no 
target has infinite uncertainty about its measurement. As Zi^k 
can be arbitrary for such an agent, one can regard either //, 
or Vi^k as arbitrary. From this point, to simplify the statement 
of the observability conditions, it is assumed that //, = 0 if 


the target is not observed by agent i at time k. The naive 
agent can be therefore defined as follows. 

Definition 2 (Naive agent): Let yiC'F be the set of ver¬ 
tices corresponding to all naive agents in the network. Then 
/ G HI if {F,co\{Hj}j(zjfi is not an observable pair. 

In other words, an agent is naive as long as there exists 
one state component which cannot be recovered from the 
collective measurements in its inclusive neighborhood. In 
such a case, the agent is actually only naive with respect to 
this state component. In the rest of this paper, for simplicity, 
we focus on naive agents that can recover none of the state 
components. In the case where naivety is discussed with 
respect to some state components, the same conclusions hold 
for the corresponding subspaces of the state space. 

The objective is formulated as follows. Each agent is 
equipped with a local sensor formulated by (|^. The target 
makes uses of both the local information sources, and the 
information obtained from its neighbors through communi¬ 
cation to come up with a consistent estimate of the dynamic 
state in with as much confidence as possible. The esti¬ 
mation process should not depend on any global information. 


IV. Distributed Estimation with Hybrid 
Information Eusion Strategy 


Suppose that at time instant k, each agent is able to take 
its local measurement Zi.k- Besides Zi^k, each agent has a 
consistent prior estimate of the state of interest, denoted as 
Xi^k\k-i- The local prior estimate error and approximated error 
covariance are defined as, respectively, i)ik\k-i —Xi^k\k-i ~Xk 
and hPi,k\k-u where = E[fii^k\k-ifla\k-i\ 

the true error covariance of Xik\k-\- The following assump¬ 
tions are made through this paper. 

Assumption 2: For each k G Z+: ([|l) All local mea- 
surement noises are mutually uncorrelated, i.e., E[v, a:vJj(.] = 
0 , ViV y; (j^2) Any local prior estimate error is uncorrelated 
with any local measurement noise, E[i 7 ;^|^_ivJ^] =0, V;,y. 


A. Distributed Hybrid Information Fusion (DHIF) 

Now we discuss the strategy to fuse all information 
sources available to agent /, at kG Z+. The proposed strategy 
is a hybrid of two fusion steps. 

Step 1. Fuse the prior estimates obtained from all agents 
in the inclusive neighborhood, i.e., {xj and ob¬ 

tain an intermediate estimate and its approximated error 
covariance, denoted as, respectively, Xik and Pik- Since the 
correlations between each pair of the prior estimate errors in 
the inclusive neighborhood are neither known nor negligible, 
the Cl algorithm [2] is used for this step. Let I„, Pj^k\k-i’ 
Xj.k\k-i’ W £ Ji’ Pi,k and Xj k play, respectively, the roles of 
C,, Ra^, aj, V; G {I,-- - ,p}, Rc and etc in •HJl. It follows that 


(^) rt 1 


Pf,!=LjeJ.(0iJ’Pp 


where G 


p-it 
Jfk -1 ’ :,k 


,k - 'LjeJiO^ij Pi,k\k-t^jMk-t^ 

is the weight that agent i assigns to the 

information received from agent /, at time instant k. The 
(k) ' ' 

selection of will be discussed in detail in Section 

= 1 for 


IV-B 


Note that the Cl algorithm requires that Lye/, 
any given i or k. 



Step 2. Fuse the intermediate estimate obtained from Step 
1 (i.e., Xijc) with all measurements obtained from the inclu¬ 
sive neighborhood (i.e., {zj}jeJi), and obtain the posterior 
state estimates and its approximated error covariance. The 
conclusion of the following lemma is necessary for this step. 

Lemma 2: Under Assumptions (|^1) and (j^2), {vj^k\jeJi 
and {xi^k—Xk) are mutually uncorrelated, Vi G iV. 

Proof: With Assumption (|^1) satisfied, it suffices to 
show that for each I G 7,-, E[(i, =0. This can be 

verified as 

myXLk-xk)vik\=mPi,k Y, ^ffpik\k-\^hk\k-\ 

j^Ji 

-Pi,kPfk'^k)vIk]=Pi,k E = 0 , 

i^Ji 

where Assumption (|^2) is used for the last equality. ■ 

As shown by Lemma all |7,| + 1 information sources 
to fuse have mutually uncorrelated errors. Therefore, Eq. 
can be applied. Without loss of generality, let Hj, Rj and 
Zj.kj Vy G Ji play, respectively, the roles of C,-, Ra- and a,-, Vi G 
{I,-- - ,p - 1 } in 0, let I„, Pi^k, Xi^k, Pj,k\k and xjm play, 
respectively, the roles of Cp, Rap, flp, R^ and a* in Q, with 
Pi k and Xi k obtained from the previous step, it follows that 

PiMk =(E ^^pjMk-i + E ^7 

j&Ji jeJi 

Xi,k\k = ^a|i:(E ^ij^Pjfk-l^jfk-l + E Pj 

JeJi ' jeJi 

We refer as the update steps of the proposed algo¬ 
rithm, named Distributed Hybrid Information Fusion (DHIF), 
implemented by agent i at time instant k. Here Xi^k\k and 
Pi,k\k in are, respectively, the local posterior estimate 
and the approximated local posterior error covariance by 
the DHIF. Define the posterior estimate error and the true 
posterior error covariance as, respectively, f)i^k\k — Xi^k\k ~ Xk 
and Pi k\k — ^[ni.k\k'nJk\k]- consistency is preserved in this 
2 -step fusion process, as stated in the following lemma. 

Lemma 3: Under Assumptions (|^1) and (|^2), the up¬ 
date steps (|^ preserve the consistency, i.e., Pi k\k h Pi,k\k 
if {zj,k}jeJi and {xik\k-i}jGJj are all consistent information 
sources. 

Proof: The fact that Xik\k given by preserves 
consistency can be directly observed because both Step 1 and 
Step 2 are fusion strategies that preserve the consistency. ■ 

B. Weights Selection for Cl Algorithm 

Because the Cl algorithm is used in Step 1, the update 
steps in (|^ only give a suboptimal estimate. As naive agents 
could come up with estimates with very low confidence, 
if relatively high weights are assigned to such estimates, 
the intermediate estimate obtained by Step 1 will become 
less confident. Therefore, the selection of the weights for 
the embedded Cl algorithm in Step 1 should be determined 
carefully. This is especially important in the scenario where 
the agents are allowed to communicate only once with 
their neighbors in between, as the advantages of asymptotic 


properties brought by the consensus-based algorithms are lost 
in such a scenario. The optimal set of can be 

obtained by minimizing tr(L;Gy,-*’ "'^ich can 
be casted as the following Semi-definite Programming (SDP) 
problem [ 12 ]: 


minimize u^l, 

U 

subject to E = 1 ; 0 < (Oi< (O^j^ < 1 , GJi, 
jeJi 


'^jeJi^ij^Pj,k\k-l 

ej M(/) 


AO, 1 = 1,- 


(7) 


where w,-, Vi are sufficiently small constant lower bounds for 
selecting weights. Here e/ G K” is the canonical basis vector 
whose ith entry is one and m G K". Due to the convexity 
of the problem formulated in Q, the set of weights can be 
determined efficiently. In the case where solving Q is still 
considered as computationally expensive, some suboptimal 
approximations can still be used [13]. 


C. Recursive Form 


Algorithm 1: DHIF Implemented by Agent i at Time k 

1 if ^ = 1 then 

2 1^ initializes i,-po F] po 

3 computes '^i.k = PP,\k-\ ^i-k-PP,i\k-\^iMk-\ 

4 takes the local measurement Zi^k 

5 computes = hJR f^ Hi and = hJRJ^ zi^k 

6 sends 5 ,-"Ei k and ^j k to agent j, V j such that 
i G Nj 

7 receives Sj^k, yj,k, '^j.k and ^j^k from agent j, Vj G Ni 

8 selects the set of weights {co^j^}j^j- 

9 computes = Ljg/,. CO-j^^i,k and fp*, = Lje/,. 


ro computes y,'^, = and Si^k = I.jeJiSj,k 

n updates local estimate and approximated covariance 

Pi,k\k = {Si± + 't^i,k) (8) 

Xi,k\k = Pi,k\k {yi,k + ^i,k) (9) 

r 2 predicts local estimate and approximated covariance 

Pi,k+i\k = FPkkikF^+BQB'^ ( 10 ) 

Xi^k+l\k PXjk\k ( 11 ) 


With the prediction steps formulated based on the system 
dynamics in Q, the recursive form of the distributed hybrid 
information fusion (DHIF) strategy implemented by agent i 
at time k is summarized in Algorithm [T] 

Theorem 1: Under Assumption]^ Algorithmpreserves 
the consistency. That is, Pi^k\k-\ ^ Pi,k\k-\ and Pi^k\k ^ Pi.k\k, 
Vk G if the initialized local estimates are 

consistent. That is. 

Pi,i\o h E[(ipi|o-xi)(x,'i|o-xi)"^], Vi. 


( 12 ) 







Proof: Lemma has shown that the update steps in 
(|^ preserve the consistency of the information. Therefore, 
it suffices to show if Pi^k\k ^ Pi,k\k^ on® can obtain Pi^k+i\k ^ 
^■,<r+l|A:- Note that Pi^k+\^ = PPi.k\kP^ BQB^ +^1 +^ 2 , 
where = F¥\f]ik\kw'l^\\B'^ and 0.2 = Since i)i^k\k 
in a linear combination of {.ro,{w/}{Lp{v;}^^j}, each of 
which is assumed to be uncorrelated with Wk+i, it follows 
that E[tj, =0. Therefore, ^2 = =0. It follows 

from ([^ that if Pi^k\k ^ hk\k, Pi.k+\\k = PPiMkP^ +BQB~^ ^ 
PPi,k\kP~^ +BQB^ — Pi^k+i\k- Therefore, the prediction steps 
in Algorithm also preserve the consistency. It follows 
that any estimate at any agent obtained by the recursive 
Algorithm [T] is consistent, if all local estimates initially fed 
to the algorithm are consistent, as formulated in ( [T2| . ■ 

Remark 1: It is worth mentioning that ( [T^ can be easily 
satisfied in general. The prior knowledge about the state of 
interest can be learned in an off-line manner before the fusion 
process. In the worst case, each agent can simply choose 
Pf^Q = 0 , which indicates the infinite initial local uncertainty 
so that (0 is satisfied. 

V. Comparisons with Existing Algorithms 
In this section, we analyze the advantages of the proposed 
DHIF algorithm, and compare it with some existing algo¬ 
rithms in the literature. 

A. Robustness in Presence of Naive Agents 
Note that (|^ can be written as 

^i.k\k ^i.k\k— \ ~'rPi^k\k {fi.k ^i.k^i.k\k—\^ 

'PPiMk 52 ^\i'^i,k\k-\^^hk\k-'^ —^i,k\k-\)- 

NJi 

As observed in 0, the approximated posterior estimate 
error covariance Pi^k\k^ which implies the uncertainty of agent 
fs local estimate, is multiplied to both of the last two terms. 
On one hand, if agent i is more confident about its own 
local estimate, Pi^k\k will be small. In such a situation, the 
effects on caused by the last two terms in ( [T3] l will be 
attenuated by a small Pi^k\k- On the other hand, when Pi^k\k 
is large, ( |T3] l will push the local posterior estimate to match 
its neighbors’ measurements and their local prior estimates, 
as described from the following two perspectives. 

(1) Note that y;,*: - ^ Rf'^izi^k- 

HjXik\k -\)7 which can be regarded as a weighted sum of 
the innovation terms that push Xi k\k to match zj^ki Vy S 7,-. 
If a neighboring agent j does not directly observe the state 
of interest, Rf^ =0. Thus, its bad effects on the update of 
Xi_k\k^ caused by the erroneous local measurement will 
be eliminated in an automated manner. 

(2) Similarly, the last term in ( [T3] l can also be regarded 
as a weighted sum, which pushes Xik\k to match Xjk\k-i, 

Vy € Ji- The weight that agent i assigns to agent j is the 

ik) —1 

multiplication of (O- and P^ A j. The former component 
is selected to increase the confidence of the estimate while 
preserving its consistency; the latter one, which implies the 
confidence of each neighbor’s local prior estimate, pushes the 
local estimate toward its neighbors’ with higher confidence. 


Let 5 be a scalar parameter. If the last term in ( [T3] l is 
replaced with dPik\kfLjeJi{^j,k\k-\ ~Xi^k\k-\)^ the update step 
of the local posterior estimates proposed by the KCF is 
obtained. The KCF [7] has been shown to perform well with 
guaranteed convergence under certain conditions. However, 
it should be noted that the KCF relies on the assumption 
that the state of interest is jointly observable in the inclusive 
neighborhood of every agent. The divergence of the KCF 
in presence of naive agents has been shown in [ 8 ] and 
[9]. Essentially with the last term of ( |T3] l replaced with 
^Pifk Y.ieJi{xj,k\k-\ ~^i,k\k-\) in the KCF, each agent equally 
weighs its neighbors’ prior estimates regardless of the con¬ 
fidence of these estimates. The performance is therefore 
deteriorated especially when there exist some other naive 
agents in its neighborhood. More detailed analysis on the 
performance of the KCF in presence of naive agents can be 
found in [ 8 ]. 

B. Fully Distributed 

In the GKCF [ 8 ] and ICF [9], the rate parameter e is 
required to be chosen between 0 and 1 /Amax to guarantee the 
performance of the embedded average consensus algorithm. 
If the maximum in-degree is changing with time, a proper 
selection of £ might not be as good as before. Even if 
Amax is known, it is not clear how to select a nice £ 
in the studies of the GKCF and ICF. In [ 8 ], [9], after 
obtaining a new local measurement, each agent is allowed 
to communicate with its local neighbors for infinite times, 
before it finally updates its posterior local estimate at the 
current time instant. Therefore, in such a case, any selection 
between 0 and l/Amax will guarantee that every agent’s 
local estimate asymptotically becomes identical before its 
next measurement update. Unfortunately, in the realistic 
case where every agent communicates only once with its 
neighbors before updating its local estimate, the importance 
of selecting £ is similar to the selection of weights in the 
proposed DHIF, as discussed in Section |IV-B| 

Besides Amax, the ICF also requires each agent to know 
the total number of agents N in the network to asymptotically 
approach the centralized solution via infinite communication 
steps before the agents update their local estimates. Similar 
to the discussion on Amax, N could be changing over time. 
Moreover, in the case focused in this paper (single communi¬ 
cation step), the ICF is not able to obtain the optimal solution 
even with the correct knowledge of N. 

The proposed DHIF algorithm does not require any global 
information. It is run in an automated manner and is adaptive 
to the locally unknown changes in the network. 

C. Consistency of Estimates 

The consistency is one of the most fundamental but 
significant properties to be preserved during the information 
fusion process. That is, the approximated error covariance 
should be lower bounded by the true error covariance. The 
approximated error covariance of an inconsistent estimator is 
over-confident, and hence cannot indicate the uncertainty of 
the estimate. Let the counterpart of Pi^k\k Pi^k\k-\ obtained 




by the ICF be, respectively, and j. In the case 

where the agents communicate only once before updating 
their local estimates, the update steps of the ICF proposed 
in [9] can be written as 

P^k = [E ^ (T,X 

jeJi j£Ji 

where Oij, Vj are the weights related to the rate parameter 
e. Therefore, it is very possible that 

N ^ aijHjRJ^Hj RJ^Hj. (15) 

jeJi jeJi 

Note that the right-hand side of the above equation is the total 
information contained in {zj^k]ieJi- Therefore, it is possible 
that the obtained from ( |T4l ) is smaller than the true error 
covariance of the local posterior estimates, especially when 
|7, |, which is usually the case in sparse wireless sensor 
networks. 

D. Confidence of Estimates 

While the consistency is guaranteed, one would also come 
up with an estimate that is as confident as possible. In the 
case where the agents communicate only once before local 
updates, the update step of the approximated error covariance 
obtained by the KLA algorithm [10], denoted as has 
the following form; 

Pm = {E +H]Rf^H^]}-\ 

where ( 7,4 = with E being a stochastic matrix asso¬ 

ciated with the communication graph. In general, given the 
same set of prior estimates and local measurements from the 
inclusive neighborhood, the local posterior estimate obtained 
by (|^ is more confident than that obtained by ( |16] l. This is 
mainly due to the following two reasons. 

(1) The KLA algorithm did not specify the selection of 
the weights cr^', W- The only constraint formulated in [10] 
is E being a primitive stochastic matrix. When only one 
communication step is feasible, the weights really matter in 
minimizing the fused 

(2) More importantly, even if we suppose that the same 

set of weights, say (7,y, W € Ji, are selected for both fusion 
processes in and the update steps of the proposed 
DHIF are still guaranteed to give more confident estimates 
since RJ^Hj A YjeJ/^ijHjRJ^Hj, which in turn, 

implies that A Fj-^j^. 

Superficially, might look similar to ([Tg. Flowever, the 
philosophies behind the proposed DHIF and the KLA are 
very different. Compared to the proposed DHIF, the update 
steps of the KLA can also be regarded as a two-step fusion 
process. The first step is that, for each i £ Y, fuse Zi,k and 
^*i)t\k-x ™ intermediate estimate xf'jf with being 

its approximated error covariance. The fusion strategy (|^ 
is adopted as Zi^k and have uncorrelated errors. The 

second step is to fuse {xP^}jeJi- Note that the information 
sources in this set have unknown correlations in general due 
to the fact that jeJ,- become highly correlated as 

the fusion process goes on. Therefore, the consensus on the 


PDFs, which is equivalent to the Cl algorithm in the case 
therein (see [ 10 ] for details), is used to obtain the posterior 
estimate and approximated error covariance. Note that during 
the overall fusion process of the KLA algorithm, the fact that 
all local measurement noise are mutually uncorrelated is not 
utilized. By fusing the and zt^k firsf Z/,* is treated as 

correlated with Zj^k^j G M with unknown correlations. This 
makes the final estimate become relatively conservative. 


VI. Boundedness of the Estimate Error 


In this section, the sufficient condition for each local esti¬ 
mate error to be upper bounded, is formulated. Specifically, 
we assume the communication topology and the set of naive 
agents are both fixed. The scenario in which they are time- 
varying is left for the future work. Due to the similarities 
in terms of the structures of the DHIE and KLA algorithms, 
we borrow the framework of the proofs in [10], and add our 
extensions. The following lemma will be used in the proof 
of the main result. 

Lemma 4: Let D be the stochastic matrix associated with 
Let and ^ be two exclusive sets of vertices 
such that Me SS = 'P. Suppose that is strongly 

connected, where G ^}- Also 

suppose that there exists a directed path from M to every 
agent j G M but there is no edge from any agent in M to 
M. Then there exists a k such that {D^}(,j) > 0, V; £ 'P 
and i G M, if k > k. Moreover, the same conclusion holds 
for where Djt is the multiplication of k matrices 

which belong to the stochastic matrices family induced by 
D. 

Proof: Without loss of generality, let .^ = {1, • • • ^\MW 
and ^ = {|^|-|-1,- • ■ ,N}. Let be the stochas¬ 

tic matrix associated with the strongly connected graph 
Thus is primitive. Because there is no edge 
from to D can be written as a block lower triangular 
matrix 

D= \ ® 1 , 

[ DmM Dm \ ’ 

where and Dgg are matrices with appropriate dimen¬ 

sions. It follows that 


d'^ 


D% 0 


where 


k-X 


r=0 


X-r 


As is primitive, > 0 for any sufficiently large k. Thus 
it remains to show that > 0 for any sufficiently large 
k. This can be shown by induction. 

Suppose that when k — k', > 0. Also suppose that agent 

i'l £ ^ has a direct edge from M. Thus, contains 

at least one positive entry. It follows that 

= {D}e,){P>''}pjem) 

> {D}{iijem)D%, 

where the last inequality is due to the fact that D > 0. As 
Jem) contains at least one positive entry and > 0 , 
it follows that Jem) > 0- Moreover, it follows that 






as k! oo, Iv^, where vi > 0 is the left eigenvector 

associated with the simple eigenvalue 1 of satisfying 
Iji'illi = 1. That is, each entry in is uniformly 

lower bounded above, for any k>k'. Let i 2 G ^ be a child 
vertex of ii. Thus, >0. Since >0 

as previously shown, it follows that 


By the similar approach, it can also be shown that 
> 0 for any k>]^ + \. Therefore, it can be 
eventually shown that for any agent i that has a directed path 
from some leader component there exists a sufficiently 
large k such that, for any k>k, W(,,L>0, VjG^. 

As each corresponding entry of all matrices, which belong 
to the stochastic matrix family induced by the same matrix 
D, has the same type (zero/nonzero), the same conclusion 
holds for ■ 

The following lemma is also used for the proof later on. 
Lemma 5 ( [10]): Let T be a nonsingular matrix. Then 
for any T 0 and A 0 , there exists a /3 G (0,1] such that 
{FQr^F^ +Y)-^ A PF-^Q.F-^ for any A 

Theorem 2: Suppose that Assumptions [T] and Eq. ( [T^ 
are satisfied. Also suppose that the state transition matrix F 
is nonsingular. Then for each agent i G Y, there exists k G Z+ 
and P, 0 G such that P,- A P; Wk > k, if there exists 
at least one strongly connected subgraph with 

joint observability, where C r and (%. = {iij)\iij) G 
S’, i, j G 1%} that has a directed path from to agent i. 

Proof: As the proof follows the framework of Theorem 
3 in [10], only the part contributed by this paper will be 
shown in detail. As shown in Theorem [T] if ( [T2| is satisfied, 
then for any k G , Pi,k\k 'S Pi,k\k 0- Therefore by Lemmaj^ 
{FPi^k\kF'^ +BQB^)-'^ A g 

such that {Dk}^ij) = dlj\ where if j G f and 

d^j'^ = 0 if j ^ Ji, yi,j. It follows from ([^ that 


Pu^k = Si 


i,k ' 


-l,dlf{FPj^k-m-i 


F' +BQB') 


T\-l 




7-Td-I 


(17) 


J,k-\\k-P 


Define = DqDq^\ ■Dp for some q > p, where {•} is 
an element-wised ceil function. Note that Dp has the same 
structure with D| for any k G Z+. Suppose that k>2, 
using the recursive inequality in ( [TT] ), it follows that P. 

+ Where 


Fi=Si.k+tP"^'L{^lr+2}iiJ)F^ 


t=2 




1-T 




t=2 






The last inequality is due to the fact that 5, ^ A 0 and 
PJ^ = 0, Vj G 07. For simplicity, suppose that 


Tl 


-/ 2 ) ( 1 


'© 




Fig. 1. Communication topology for simulation. T: target; Dashed circle: 
agents with partial observability; Solid circle: agents with full observability; 
Dotted circle: agents not observing the target; Red: naive agent. 


has a directed path to every other agenj^ Thus Lemma 
can be usec0 Therefore, there exists f such that if T > f, 
{D^_^_i_ 2 }(,j) > 0,V! G y and j G Recall that j3 > 0 and 
Rj 0, Vj ^ 07. It follows that if agents in has joint 
observability, T- 0 for any k>k^f + n. Let Pi = (r')^^ 
It follows that Pi >- 0 and P; A Pi k\k- ■ 

Theorem formulates the sufficient condition for the 
bounded local posterior estimate errors. Essentially, the local 
estimate errors at a certain agent is bounded as long as the 
agent is able to eventually obtain the knowledge about the 
entire state from somewhere. 

The strategies to decrease the upper bound of Pi k\k for 
each i G Y can also be observed from the proof of Theorem 
1^ Note that P^'^ A t; 0 with T; defined in the proof of 
sufficiency. This implies that the upper bound of Pi k\k can be 
decreased if T; is increased. This is achieved if: 1) there are 
fewer naive agents so that the summation with respect to j 
has more positive terms; 2) the graph is denser so that for a 
certain k the summation with respect to T has more positive 
terms; 3) local measurements are more accurate (smaller Rj) 
or redundant (Hj with more rows) so that each term in the 
summation has a greater value. 

VII. Simulations 

In the simulation, the target follows the linear dynamic 
system in Q. The parameters are adopted from the simula¬ 
tion example used in [10], and listed as follows: 


I 2 ^7-12 

n — 

■ 5A3I2/3 5A2I2/2 ■ 

0 I 2 

5 Li — 

_ 5A2I2/2 5Arl2 


where Aj- = 4 is the sampling interval. A distributed sensor 
network with A = 10 agents, whose directed communication 
topology is shown in Figure [T] is used to track the state of 
the target. Let 7/2 = [1 0 0 0], = Hj = [0 1 0 0] and 

Hi = [hJ hJ ]^. Correspondingly, let R 2 = Rj = Ri = 225, 
Pi = 225 I 2 . Also let Hi = 0 and Pu* =0 for the other 
agents. Note that there exists no directed spanning tree in this 
topology. However, the conditions formulated in Theorem]^ 
are satisfied. 

The proposed DHIF algorithm is implemented, where the 
weights in Step of Algorithm [T] are solved from Q. The 
KLA algorithm and the ICE are selected for comparisons. 

* When there are multiple disjoint strongly connected subgraphs, the 
subgraphs can be treated separately. 

^Note that although the first condition in Theorem does not assume 
there is no edge path from agent / ^ ^ to agent j 6 3^, this can always be 
guaranteed by including all 3^ which has a directed path to j ^ ^ into 
M. Therefore, Lemma can be applied. 














(a) Agent 1 



Time instant k 


(b) Agent 6 

Fig. 2. 3<7-bound given by each algorithm: x-position 



Fig. 3. Comparisons of RMSE vr* = T IJ=i s E/li i^Mk ~ 4^' 
obtained by each algorithm, where the superscript “{])” is the trial index 


As the selection of weights was not specifically mentioned 
in [10], we let Gij = l/|y,|,Vt. The rate parameter e used in 
the ICF is selected to be the same as that in [9], i.e., £ = 
0.65/Amax- A hypothetical centralized Kalman filter, whose 
observation matrix has the form of // = col{//,}^j, is used 
as the benchmark. 

The local absolute posterior estimate errors of agent 1 
(locally observable) and agent 6 (very naive), obtained by 
each algorithm, are plotted, respectively, in Figure |2(a)| and 
Figure 2(b) The 3 (7-bound corresponding to each algorithm, 
with a being the standard deviation (STD) computed from 
the approximated error covariance by the algorithm, is also 
plotted in the same color but dashed line. As observed in Fig¬ 
ure |2(a)| the posterior estimate error of the ICF’s exceeds its 
3(J-bound frequently. Actually, its 3(7-bound is even less than 
the one obtained by the CKF, which provides the minimum 
possible MSB. Therefore, the ICF can be overconfident when 
the agents communicate with each other for only once before 


updating their local estimates, as analyzed in Section V-C 


In both Figure |2(a)| and Figure |2(b)| the cr value computed 
by the KLA is around 1.5 times than that computed by the 
DHIF. 


The same simulation as the previous one are further 
implemented for 500 trials. The rooted mean squared errors 
(RMSE) in estimating two state components, denoted as 
for k € [1,70], are plotted in Figure As observed, the 
proposed DHIF has the lowest RMSE among all distributed 
algorithms. 


VIII. Conclusions 

The problem of distributed state estimation of a linear 
dynamic process in the wireless sensor networks has been 
considered. A distributed hybrid information fusion (DHIE) 
algorithm has been proposed. In the proposed algorithm, 
each agent communicates with its neighbors for only once, 
before updating its local estimates and approximated error 
covariance. In the proposed algorithm, the consistency of 
estimate is guaranteed to be preserved. Meanwhile, the 
confidence of each local estimate is improved by efficiently 
utilizing all information sources available to the local agent. 
The proposed algorithm is fully distributed and robust against 
the presence of naive agents. The sufficient condition has 
been formulated to guarantee that the local estimate error 
being bounded at steady state. 
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